Semantic principal video shot classification via mixture Gaussian
نویسندگان
چکیده
As digital cameras become more affordable, digital video now plays an important role in medical education and healthcare. In this paper, we propose a novel framework to facilitate semantic classification of surgery education videos. Specifically, the framework includes: (a) Semantic-sensitive video content characterization via principal video shots. (b) Semantic video classification via a mixture Gaussian model to bridge the semantic gap bwteen low-level visual features and semantic visual concepts in a specific surgery education video domain.
منابع مشابه
Semantic video classification with insufficient labeled samples
To support more effective video retrieval at semantic level, we introduce a novel framework to achieve semantic video classification. This novel framework includes: (a) A semantic-senstive video content representation framework via principal video shots to enhance the quality of features (i.e., the ability of the selected low-level multimodal perceptual features to discriminate among various se...
متن کاملTokyoTechCanon at TRECVID 2012
We aim at developing a high-performance semantic indexing system using Gaussian-mixture-model (GMM) supervectors and tree-structured GMMs [1, 2, 3]. GMM supervectors corresponding to six types of audio and visual features are extracted from video shots. Tree-structured GMMs reduce the computational cost of maximum a posteriori (MAP) adaptation for estimating GMM parameters while keeping accurac...
متن کاملSemantic Shot Classification in Sports Video
In this paper, we present a unified framework for semantic shot classification in sports videos. Unlike previous approaches, which focus on clustering by aggregating shots with similar low-level features, the proposed scheme makes use of domain knowledge of specific sport to perform a top-down video shot classification, including identification of video shot classes for each sport, and supervis...
متن کاملRecognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model
Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....
متن کاملGlobal Journal of Computer Science and Technology
Rapid growth in multimedia technologies facilitates the acquisition and storage of videos in a cost effective manner; leads to the processing of ginormous videos. However, for effective processing, suitable search methodologies are essential pre-requisite in any video processing system. In this paper, we propose a proficient content-based video retrieval system with the aid of extensive feature...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003